Learning to identify video shots with people based on face detection
نویسندگان
چکیده
We examine how to identify video shots with at least two humans using only detected face information. While face detection is much more reliable than shape based people classification in broadcast video, one particular difficulty is that, when there are several humans in an image, the accuracy of face detection is usually significantly degraded, which leads to poor performance in identifying shots of ‘people’. Furthermore, while our standard face detector works from individual still images, we propose using the statistics of face information of images within a whole shot as additional evidence in deciding whether or not a video shot belongs to the ‘people’ category. Empirically, we studied which statistics of face information are more informative than others and how to combine different statistics together in order to achieve better prediction.
منابع مشابه
A Novel Face Detection Method Based on Over-complete Incoherent Dictionary Learning
In this paper, face detection problem is considered using the concepts of compressive sensing technique. This technique includes dictionary learning procedure and sparse coding method to represent the structural content of input images. In the proposed method, dictionaries are learned in such a way that the trained models have the least degree of coherence to each other. The novelty of the prop...
متن کاملNeural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features
This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...
متن کاملAudio-visual synchrony for detection of monologues in video archives
In this paper we present our approach to detect monologues in video shots. A monologue shot is defined as a shot containing a talking person in the video channel with the corresponding speech in the audio channel. Whilst motivated by the TREC 2002 Video Retrieval Track (VT02), the underlying approach of synchrony between audio and video signals are also applicable for voice and face-based biome...
متن کاملThe Effect of Web-based Flipped Classroom Approach on Learning and Satisfaction of Medical Students Comparison with Lecture-based Method
Introduction: Student-centered educational models, such as Flipped classrooms, seem to provide more educational opportunities for learners, especially when combined with web technology. This study aimed to evaluate the effectiveness and satisfaction of medical students with the web-based Flipped classroom method in comparison with the lecture-based teaching method. Method: This is a quasi-exper...
متن کاملUnsupervised Approach for Retrieving Shots from Video
Acquiring the video information based on user requirement is an important research, that attracts the attention of most of the researchers today. This paper proposes an unsupervised shot transition detection algorithm using Autoassociative Neural Network (AANN) for retrieving video shots. The work further identifies the type of shot transition, whether abrupt or gradual. Keyframes are extracted...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003